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Abstract — We study a class of robust network design problems 
motivated by the need to scale core networks to meet increasingly 
dynamic capacity demands. Past work has focused on designing 
the network to support all hose matrices (all matrices not 
exceeding marginal bounds at the nodes). This model may be 
too conservative if additional information on traffic patterns is 
available. Another extreme is the fixed demand model, where 
one designs the network to support peak point-to-point demands. 
We introduce a capped hose model to explore a broader range 
of traffic matrices which includes the above two as special 
cases. It is known that optimal designs for the hose model are 
always determined by single-hub routing, and for the fixed- 
demand model are based on shortest-path routing. We shed 
light on the wider space of capped hose matrices in order to 
see which traffic models are more shortest path-like as opposed 
to hub-like. To address the space in between, we use hierarchical 
multi-hub routing templates, a generalization of hub and tree 
routing. In particular, we show that by adding peak capacities 
into the hose model, the single-hub tree-routing template is no 
longer cost-effective. This initiates the study of a class of robust 
network design (RND) problems restricted to these templates. 
Our empirical analysis is based on a heuristic for this new 
hierarchical RND problem. We also propose that it is possible to 
define a routing indicator that accounts for the strengths of the 
marginals and peak demands and use this information to choose 
the appropriate routing template. We benchmark our approach 
against other well-known routing templates, using representative 
carrier networks and a variety of different capped hose traffic 
demands, parameterized by the relative importance of their 
marginals as opposed to their point-to-point peak demands. This 
study also reveals conditions under which multi-hub routing gives 
improvements over single-hub and shortest-path routings. 

I. Introduction 

Traditional communication networks are designed based on 
knowledge of an expected traffic demand matrix that specifies 
the aggregate traffic between every pair of nodes in the 
network and evolves slowly, over a time scale of months or 
years. In an era of data-dominant communication networks 
with dynamic demand patterns, this traditionnal network de- 
sign methodology loses its cost-effectiveness. For instance, in 
emerging data services such as YouTube, user generated video 
content and flash crowds are triggering an increasing amount 
of dynamic uncertainty in the traffic demand. One specific 
concern is that the real-time estimation of the dynamically 
changing traffic patterns in large data networks is intractable. 

'This author's work was done while studying at McGill University, and 
with generous funding from NSERC Discovery Grant No. 212567. 



Moreover, a pessimistic approach of designing for peak point- 
to-point demands (also referred to as the fixed-demand model) 
throws away critical information for achieving "cost-sharing" 
between the varying demand patterns that arise over time. 

A traffic model that has gained considerable popularity in 
coping with dynamic demands is the hose model 0D, El- In 
this model, only bounds on the ingress/egress traffic of the 
nodes are known. These bounds (called marginals) are physi- 
cally represented by the various interface speeds of networking 
hardware at any given node. The actually encountered point- 
to-point traffic distributions are subject to dynamic variations 
during network operation, but the network is provisioned to be 
able to support any possible pattern meeting the ingress/egress 
bounds. Formally, in the symmetric (undirected) demand case, 
the space of hose matrices Hu consists of symmetric matrices 
D satisfying the bound: < U(i) for each node i, with 

marginal U(i); in addition, the main diagonal of these matrices 
is typically set to zero, reflecting the fact that nodes generally 
do not send traffic to themselves. However, the flexibility of 
the hose model can also be a drawback since totally random 
demand matrices rarely ever occur in practice, and the operator 
usually has additional at least some a priori information about 
the traffic pattern. For instance, it may happen that there exists 
some additional peak demand U(i,j) between nodes i.e. 
Dij < U(i,j). We call this union of marginals and peak 
demands the capped hose model. Depending on the choice 
of U(i, j), the space of traffic patterns spanned by the capped 
hose model ranges from purely deterministic point-to-point 
demands (Dij = U(i,j)) to the space of unconstrained hose 
traffic (U(i,j) — mm(U(i),U(j))). Designing a network 
for the capped hose model hence avoids the massive over- 
provisioning that may arise for unconstrained hose traffic 0. 

In order to solve the routing problem for the capped hose 
model, we propose the use of a class of oblivious routing 
strategies, called hierarchical hubbing routing templates (HH), 
which generalizes the popular hub and tree routing templates. 
We give empirical results based on a heuristic for the corre- 
sponding new class of robust network design (RND) problems. 

In order to quantify the extent of the space of traffic matrices 
spanned by the capped hose model, we propose indicators 
(marginal versus peak strength measures) that could be used 
as a predictor for which traffic scenarios favour a shortest path 
routing (SP), and which favour HH routings. We also show 
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that HH is often superior in general to both SP and hub routing 
(HUB), in terms of network design (considering both link and 
node costs). 

A. Summary of Main Contributions 

1) We systematically study the capped hose model as a gen- 
eralization of the fixed-demand and the unconstrained 
hose models. This allows tapping into a whole range of 
practically relevant traffic scenarios. 

2) We initiate a formal study of a class of RND problems 
based on oblivious HH routing templates, a general- 
ization of tree and single-hub routing used by optimal 
virtual private networks (VPNs). We provide theoretical 
examples showing that SP is much cheaper than HH, 
and vice-versa. 

3) We perform an empirical study (based on both randomly 
generated and real traffic data) on the space of new 
capped hose traffic polytopes. 

4) Our initial empirical findings show that for most in- 
stances, HH is better than single-hub VPN, suggesting 
that multi-hubbing at different layers can be important 
for network cost savings. Empirical results suggest that 
for many instances HH is significantly better than SP. 

5) We provide some initial understanding of which traffic 
scenarios favour SP-like, and which are more HH-like. 
This is achieved by introducing two metrics that quantify 
the strengths of the marginals and peak demands in the 
specification of a collection of traffic matrices. These 
indicators could be used as a predictor for which of the 
two routing templates, SP and HH, is better. 

II. Traffic Engineering for Core Networks 
A. Oblivious Routing and Routing Templates 

Apart from the traffic modelling complexities mentioned 
above, which form the basis for network infrastructure plan- 
ning and design, there is a second operational complication 
arising in any network with dynamically changing traffic pat- 
terns: how should traffic be routed on an installed infrastruc- 
ture? Since centralized or distributed control planes may not 
have the speed and flexibility to adapt dynamically to rapidly 
varying traffic patterns without the risk of inducing detrimental 
oscillations 2), the vast majority of today's networks are 
designed based on the principle of oblivious routing, usually 
based on SP, both on the circuit and on the packet layer. 

Informally, Valiant [5| defined oblivious routing as "the 
route taken by each packet be determined entirely by itself. 
The other packets can only influence the rate at which the 
route is traversed." Formally, this amounts to specifying a 
template: for each possible communicating pair of nodes i, j, 
one specifies a path [^] Traffic from node i to j is always 
routed on P. L j independent of any other demands or congestion 
in the network. Oblivious routing is attractive because it 
enables strictly local routing decisions. In contrast, stable non- 
oblivious routing would require a global network control plane 

2 Valiant actually considered the randomized version and so the flows could 
be fractional. 



performing path optimization on a time scale much faster 
than the congestion and traffic variation dynamics, which is 
impractical in most scenarios. 

For the fixed-demand traffic model, a minimum cost de- 
sign is achieved using an oblivious SP routing template. 
Using SP for hose traffic, however, may result in significant 
over-provisioning of network resources (if blocking is to be 
avoided)!!]]. On the other hand, it was proved in [6| that 
the optimal design (in the undirected setting) for the hose 
model is always induced by a tree routing template (i.e. there 
exists a network-specific tree, and the template is to use the 
path Pij in that tree between node In Q, (7) a simple 
polynomial-time algorithm was given to produce the optimal 
tree template. Their analysis shows that the resulting network 
(called a virtual private network - VPN) actually has enough 
capacity to support hub routing (HUB) within the tree. That 
is, there is a hub node h and enough capacity in the tree for 
each node to reserve a private circuit of size U(i) to h. We 
use HUB to denote such routing templates. 

B. The Capped Hose Model 

We now consider the class of capped hose matrices and 
the minimum cost network to support it. 

• If the node bounds U(i) are sufficiently large (at least 
Y^j U(i, j)), then they impose no constraints on the 
routed demand. Hence the network is only required to 
support the single fixed-demand matrix U(i,j). In this 
case, we know that an optimal design results from SP 
routing. 

• If the point-to-point U(i,j) are large (at least 
min{Z7(i), U(j)}), then they impose no constraint. Hence 
the network must support all hose matrices associated 
with the marginals U(i). In this case, we know that HUB 
routing is optimal. 

• In the general case where neither marginals nor peak 
demands clearly dominate the space of possible traffic 
matrices, we see that there are gaps at the two ends of 
the spectrum: at the fixed-demand end, we may incur 
large penalties if we design the network via hub routing. 
Similarly, at the unconstrained hose-demand end we may 



incur large penalties if we use SP (see Section IV-D i. 

One would normally assume that a node's marginal is big 
enough to allow the routing of any of its peak demands. Hence, 
we call [maxj U(i, U(i,j)] the relevance interval of a 

marginal U(i) for a given set of peak demands. A similar 
reasoning for peak demands suggests that U(i,jY& relevance 
interval is [0, mm(U(i), U(j))]. 

In order to understand the spectrum of traffic matrices 
between these extremes it is necessary to consider more 
general oblivious routing strategies. To solve this problem, we 
propose the use of HH (see Section [TV}. On the other hand, 
to characterize when we are close to the extremes and can 
rely on standard HUB and SP routings, we introduce a pair 
of metrics. 
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C. Marginal and Peak Demand Strength Metrics 

For any capped hose model, we propose two vector mea- 
sures 7r (for peak) and /i (for marginal) to indicate the rele- 
vance of the different traffic measurements in defining a given 
collection of traffic patterns. For example, if is large, it 
means that the marginal U (i) should play a significant role. On 
the other hand if is large, the peak demand U(i,j) has 

a more significant role in determining a cost efficient routing 
template. One aspect of this study is to examine whether these 
7r and fi measures can be used to classify traffic instances as 
being more favourable for a SP or hub-like routing template. 

Given marginals and peak demands, we let the strength of 
node i's marginal be: 



tx(i) = 1 



trunc([/(i)) — maxj U(i,j) 
HjU(i,3) - max j U(i,j) ' 



where trunc([/(i)) = min(Z7(i), J^- U(i,j)). Similarly, the 
strength of the peak demand between two nodes i,j is: 



trunc([/(i, j)) 
mm(U(i),U(jY 



where trunc(J7(i, j)) — mm(U(i,j),mm(U(i),U(j))). 
These metrics are faithful to our observations and range from 
(weak) to 1 (strong). 

We thus define our strength vectors fj, E R™ for marginals 
and 7r 6 K™ xn for peak demands, with \n = and = 
7r(z, j) respectively. A capped hose instance that is hose-like 
should have relatively high marginal strengths, and if it is 
peak-demand like it should have high peak demand strengths 
if it is peak-demand-like. Based on this intuition, we attempt 
to classify a capped hose instance by the Euclidean norm of 
these vectors. 

D. Hierarchical Hub Routing 

It is well-known that designing a network for hose matrices 
is quantitatively similar (e.g., up to a factor 2 in capacity) 
to designing for a single "uniform multiflow" instance. Since 
uniform multiflow instances can be viewed as a convex com- 
bination of hub routings (Q, 0), it seems natural that one 
of these hubs would deliver a good overall network design 
in terms of link capacity cost. However, in the capped hose 
model, we may "punch holes" into the space of demand 
matrices by letting some U(i,j) be much smaller than others. 
This allows for the existence of certain regions within which 
there is much denser traffic than between such regions. Since 
these regions have their own capacity sharing benefits, one 
should no longer expect a single hub, but multiple hubs, each 
serving its own region of dense traffic. 

This forms one motivation for our choice of hierarchichal 
hub routing templates (HH). Each HH template is induced by 
an associated hub tree. If the hub tree is a star, it corresponds 
to standard hub routing. If the hub tree has more layers, 
its internal nodes represent possible hub nodes for different 
subnetworks (these subnets are necessarily nested due to the 



E. The Robustness Paradigm 

We are given an undirected network topology with per- 
unit costs of reserving bandwidth. In addition we are given 
a space of demand matrices (in our case, the capped hose 
model) which the network must support via oblivious routing. 
This space, or "traffic model", is meant to capture the time 
series of all possible demand matrices which the network 
may encounter. The goal is to find a routing template which 
minimizes the overall cost of reserved capacity needed to 
support all demands in the traffic space. The formal definition 
of robust network design (RND) appears in Section [TIT] 

Due to the optimality arguments of various routing tem- 
plates for the traffic models under consideration, we focus 
on RND restricted to the routing templates SP, HUB, TR 
(tree routing) and HH. For a class of templates X, we denote 
by RNDx the RND problem where one must restrict to the 
corresponding class of templates. While RNDsp, RNDtr and 
RNDjjub have been well-studied, we initiate a formal analysis 



tree structure). A detailed description is given in Section IV 



of RNDhh in Section IV We also propose a heuristic for this 
version in Section IIV-AI 

F. Related Work 

Oblivious routing approaches to network optimization have 
been used in many different contexts from switching ((H, 0, 
1 10]) to overlay networks dill . lfT2lO . or fundamental tradeoffs 
in distributed computing (5), lfl3l to name a few. In each case 
mentioned above, the primary performance measure is network 
congestion (or its dual problem throughput). In fact, one could 
summarize the early work by Valiant, Borodin and Hopcroft 
as saying that randomization is necessary and sufficient for 
oblivious routing to give O(logn) congestion in many packet 
network topologies; n being the number of nodes. 

The present work's focus is not on congestion, but on total 
link capacity cost. This falls in the general space of RND 
problems, which includes the Steiner tree and VPN problems 
as special cases. Exact or constant-approximation polytime 
algorithms for several other important traffic models have also 
been designed, e.g. the so-called symmetric and asymmetric 
hose models lfl4l . |fT31 . The latter is however NP-hard to ap- 
proximate in undirected graphs within polylogarithmic factors 
for general poly topes fl5l . 

Our work was partly motivated by work in [3|, where 
Selective Randomized Load Balancing (SRLB) is used to 
design minimum-cosf networks. They showed that networks 
whose design is based on oblivious routing techniques (and 
specifically SRLB) can be ideal to capture cost savings in 
IP networks. This is because optimal designs reserve capacity 
on long paths, where one may employ high-capacity optical 
circuits, and partially avoid expensive IP equipment costs at 
internal nodes (their empirical study incorporated IP router, 
optical switching, and fiber costs). In this paper, instead of a 
design based on hub routing to a small number of hubs, we 
consider more general HH, as discussed in fl5l . 

Our work also extends the long stream of work on designing 
VPNs fl], Q, @, Eg], El. Previous work has focused on 
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provisioning for a VPN based on either the hose or the fixed- 
demand models (c.f. Ifl8l ). Designing for more general traffic 
models in this context has not received much attention. We 
take an intermediate step by examining the capped hose model 
which to the best of our knowledge has not been studied. 

With some thought, the new class of RNDhh problems is 
almost equivalent to solving RND but only using tree routing 
in the metric completion of the graph. The latter approach is 
exactly the idea used by Anupam Gupta (unpublished) to show 
an O (log n) -approximation for general RND via metric tree 
embeddings (cf. a long version of [19| contains full details). 

III. The Robust Network Design (RND) Model 

To understand the hierarchical hub routing problem we 
first introduce the robust network design paradigm in full 
generality. 

A. Formal Definition 

An instance of RND consists of a network topology (undi- 
rected graph) G — (V, E), as well as per-unit costs c(e) of 
bandwidth on each link e 6 E (the edge costs may be implicit 
if G is weighted). In addition, we are given a convex region 
V C M VxV j^]The region T> represents the traffic model or 
demand universe, that is, the set of demands which have to be 
supported. In other words, any matrix Del) must be routable 
in the capacity we buy on G. The cost of the network increases 
as this space grows. 

One may define several versions of RND depending on 
assumptions on the routing model. Most common in practice 
(and the focus of this paper) is the oblivious routing model 
described above. These paths often need to satisfy additional 
requirements, such as be shortest paths in SP or form a tree 
in TR. This is specified by a class of template X. 

Our main objective in the context of RND is then to 
compute an oblivious routing template in a specific class 
of templates which leads to a link capacity vector cap e 
whose overall cost is minimized (as we discuss next). We use 
RNDx(G, c, T>) to denote the cost of an optimal solution 

Given a fixed routing template V = (Pjj : V i,j), the cost 
of supporting all demands in the demand poly tope P can be 
computed in a straightforward fashion. For each edge e e 
E(G), we compute a "worst demand" matrix in P in terms 
of routing on e (using the routing V). This gives rise to the 
following sub-optimization: 

cap e (P,2?) :=max £ £> y . (1) 

i,j£V: 
e£P,j 

RND then asks for a template V that minimizes the cost of 
the overall required capacity: 

RND x (G,c,D)=mh £ c(e)cap e (V,V). 

eEE 

3 We always assume that T> is itself well-described in the sense that we can 
solve its separation problem efficiently, e.g. in polytime. 




Fig. 1. A hub tree. A "region" of terminals is represented by a dashed circle. 
Internal black nodes of the tree are the ones that will be mapped to hubs in 
the network topology. White leaf nodes are terminals. 

B. The Hierarchical Hub Routing Template (HH) 

We make heavy use of the generalization HH [15|. Such 
oblivious routing templates are derived from a tree T whose 
leaves consist of the terminal nodes, that is V(G) without loss 
of generality. We call this the hub tree. The internal nodes are 
"abstract" in the sense that they represent nodes (yet to be 
identified in G) which will act as hubs for certain subsets 
of terminal nodes. For instance, for any non-leaf node v E 
T, let T v be the subtree rooted at v dangling below it. The 
interpretation is that the leaves L v in T v represent a region of 
terminal nodes which should have a dedicated hub (this hub 
will correspond to v). The family of regions arising from a 
tree obviously forms a nested family (Figure 1). 

Obtaining an oblivious routing template from the HH class 
is a two step process. First, we first need to pick a hub 
tree, and then map its internal node to physical nodes. In 
this second stage, we find a map from each internal node 
v to a real node in G to act as a hub. Such a hub map 
77 : V(T) —> V(G) must satisfy i](v) = v for each leaf of T. 
Given such a mapping, there is an induced oblivious routing 
template defined as follows: for each pair of leaf nodes i,j in 
T, let i — vq , vi , V2 , ■ ■ ■ , vi — j be the unique ij path in T. For 
each q = 0, 1, . . . , I — 1, let P q denote a shortest 77(^)77(^+1) 
path in G. Then we define to be the concatenation of paths 
Pq, Pi, . . . , Pi-x (resulting in a possibly non-simple path). 
Naturally, the special case of HUB routings corresponds to 
taking T equal to a star. Each hub tree thus gives rise to many 
possible routing templates, one for each mapping 77 (analogous 
to the specific choice of hub node in hub routing). Call this 
the class of hierarchical hub templates. Note that, in some 
scenarios, there are single-path oblivious routing templates that 
are unachievable by hierarchical hub templates. For instance 
in a complete graph on four nodes, with unit weight edges, 
there is no hierarchical hub template which corresponds to the 
shortest path template (each consist of the edge ij). 

HH routing also responds to one practical impediment of 
hub routing architectures, pointed out in J3]. Namely, network 
providers are generally opposed to having all traffic routed 
via a single hub (or a small cluster of hubs) in the center of 
a large network. In particular, traffic within some local region 
should be handled by hubs within their own "jurisdiction". 
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Hub trees potentially give a mechanism for specifying these 
local jurisdictions. 

IV. The Optimization Problem RND H h 

There are several layers to the optimization problem 
RNDhh- First, we must find a hub tree T - this is the core 
decision for which we provide a heuristic algorithm in Section 



IV-A Second, given a hub tree, we must map it to the network 



graph to obtain a valid routing template. An efficient solution 
to this mapping problem is known for T-tope^][15]; we show 
that this can be invoked to solve the problem for general 



polytopes in Section IV-B Finally, given a template, we must 
ultimately solve for capacities it induces, as we discuss in 
Equation OJ below. 

We mention that the case where the hub tree is given 
is of interest in cases where a network provider wishes to 
self-identify a regional clustering within their network. In 
other words, they provide the hub tree, and it only remains 
to identify which nodes to act as hubs (i.e. solve the hub 
placement problem). 

In Section IV-C we discuss several classical problems 



which arise as special cases of RNDhh- Finally, we provide 
some theoretical bounds between RNDhh and RNDgp in 
Section fN-D\ 

A. Choosing the Hub Tree 

We now develop a heuristic for RNDhh- It is tailored for 
the capped hose model, but could be extended to handle any 
traffic polytope. The high-level intuition is to group nodes so 
that traffic within a group is large compared to traffic leaving 
the group. This suggests the need for a hub to handle the 
group's local traffic. It is straightforward to show that there 
is always an optimal hub tree for RNDhh where the tree is 
binary (we defer the details to the end of this section). Hence 
our algorithm's focus is to find a "good" binary hub tree. 

To construct a binary tree from the terminal nodes V, one 
could either proceed by a top-down approach (recursively 
splitting groups of nodes in two) or a bottom-up approach 
(recursively merging nodes together). The former has the 
flavour of repeatedly solving SPARSEST CUT problems. This 
problem is APX-hard and the current best approximation 
factors are polylogarithmic GUI . EH . Instead, we follow the 
bottom-up approach. We are thus seeking to repeatedly merge 
pairs of node sets which share a lot of traffic with each other. 
The key to this calculation is a sparsest cut-like measure which 
we introduce next. This is then used in a Binary Tree Sparsest 
Merging (BTSM) algorithm. 

A SPARSITY MEASURE 

We devise a sparsity measure that seeks to maximize the 
traffic between two groups, compared to the traffic sent outside 
the merged groups. This measure is very similar to the capacity 



on a fundamental cut explained in Section III-A 



4 A T-tope is a polytope of demands, denoted by Ht b< which are routable 
on some tree T with edge capacities b. This class generalizes hose polytopes, 
which can be viewed as demands routable on a star. 



In the setting of the capped hose model, the maximum 
possible demand between a pair of disjoint sets of nodes 
A, B C V is given by 



u*{A,B) 



maximize 
subject to 



for all i, j 
D H < U{i) for all i . 
Da = for all i 
D > 

This problem can be solved efficiently, since it can be cast 
as a 6-matching or max-flow problem. For instance, consider 
a bipartite graph H with bipartition A,B. Each edge ij has 
a capacity of 11™ and is oriented from A to B. We also add 
a source s with edges (s,i) for i E A with capacity U(i). 
Similarly we add sink t with edges from B. A max si-flow 
in this graph gives precisely the value u*(A,B). 

The sparsity of a pair of disjoint sets of nodes A, B C V 
is then: 



sc{A,B) 



u*(AU B,V\(AU B)) 



u*{A,B) 

One notes that it trades off the flow out of A U B versus the 
flow between A,B. 

The Sparsest Merging Algorithm 

Our proposed heuristic algorithm (see Algorithm 1) builds 
up a binary hub tree T, starting from a forest consisting of 
a set of singletons. At each step it has a forest and looks for 
a pair of rooted trees that has the minimum sparsity value. 
Specifically, it computes the sparsity measure for the leaf sets 
for such a pair of trees. The two subtrees are then merged, 
that is, a new root node is added to T and becomes the parent 
of the two previous rooted subtrees. When the forest becomes 
a single tree, we stop. Clearly this can be done in polytime. 

Algorithm 1 Binary Tree with Sparsest Merging algorithm. 
Require: A set of nodes nodes V with peak demands U(i, j) 
for each pair of nodes i, j £ V and marginals U(i) for each 
node i 6 V. 
Ensure: A binary demand tree T. 
T=(V,0) 
S <- V(T) 
while \S\ > 1 do 

Find i, j e S that has minimum sparse cut value. 

Add a new node u to T. 

Connect u to i and j. 

Add u to S. 

Remove i and j from S 
end while 
return T 

Restricting to Binary Trees. 



In this section, we show that there is always a binary hub 
tree that is optimal for RNDhh- 

Let T be any hub tree that has at least one node u of degree 
at least four. Then it is possible to form a new hub tree T' with 
chh{T') < chyi(T), and such that the sum of the degrees of 
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Fig. 2. Dividing internal nodes to get a binary tree. 

nodes with degree greater than four is smaller in T'. Hence, 
repeating the process eventually reduces the number of degree 
at least four nodes to zero, and we end up with a binary tree. 

Consider a u of T with degree at least four. Replace u by 
two nodes v! and u" such that it's old parent is now connected 
to u' , and v! is connected to it's first child as well as to u". 
Then u" is connected to the remaining children of u (see 
Figure 2). Note that the sum of degrees of nodes with degree 
at least four is decreased by one, since u' has degree three 
and u" has degree deg(u) — 1. 

The capacity on the edge between it's parent and v! is 
equal to the capacity between u and its parent; the capacity 
between vt and u" can be inferred using equation Q. All 
other capacities remain the same - the capacity between one 
of it's children and its new parent v! or u" is the same as the 
capacity between u and that child in T. Note that we only split 
u into two nodes and added enough capacity between those 
two nodes to route any feasible traffic matrix. Hence we have 
the new capacitated T' can support exactly the same demand 
matrices as T (i.e. the T'-tope and T-topes are the same). 

Finally, an optimal hub map of T can be implemented for T' 
as follows. One maps v! and u" to the node that u is mapped 
to. Hence, T' can produce the same oblivious routing as T, 
so its optimal cost chh(T') is at most chh(T). 

B. The Hub Placement Problem 

Suppose we are given a network graph G, hub tree T with 
leaf set V(G) and edge capacities it. In fl5l . a dynamic 
programming algorithm is given to find a valid mapping 
rj : V(T) — > V(G) which minimizes the hub tree's capacity 
routing cost c HH (T) = T ie =wveT u(e)dist G (r)(w), r)(v)), 
where dist(j(s, t) is the length of a shortest si-path in G with 
edge costs c. We now describe how this is used for general 
hub placement in RNDhh(G, c, T>). 

Consider any demand between If e = wv lies on the 
unique ij path in T, then this demand ultimately routes on 
a shortest path between T](w),rj(v) in G. Hence if there are 
demand matrices that route X units of flow on e in T, then 
we need to reserve X units of capacity on a (presumably 
shortest) rj(w)r)(v) path in G. In other words, the RNDhh hub 
placement problem reduces to the dynamic program mentioned 
above where the u(e) values are the solution to the following 
problem: here, for e e T, fund(e) denotes all terminal pairs 
i, j whose path in T uses edge e (i and j are separated by the 



fundamental cut induced by e). If V is the demand polytope, 
then the maximum amount of demand across edge e is 

u-D,T(e) := max } D tj . (2) 

i,j£fund(e) 

Thus a tree T plus the polytope T) induce some maximum 
capacities on the edges in T, These represent how much 
capacity we need to install on each "shortest path r)(w)rj(v)" 
to route V. These in turn drive the dynamic program for 
placing the hubs. Note that we have chosen Ut>,t to be the 
smallest capacity vector u such that T> C Ht,u- I n other 
words, designing a network to support the T-tope Ht,ut, t 
also supports T>. 

Under an arbitrary mapping rj : V(T) — > V{G), we must 
reserve uv,t{s — vw) units of capacity for the circuit on 
the shortest path between rj(v),rj(w) in G. Hence the triple 
T, T>, r\ induces a total capacity cost of 

chhIT) = ^ uv,T(e)dist G (r](v)r](w)). (3) 

e=uv£T 

We refer to the hub placement problem as determining the 
map 77 which minimizes Q. In lfT31l a dynamic programming 
algorithm is given to find the optimal mapping efficiently. 
This forms the basis of their C(l)-approximation for the 
Generalized VPN Problem. Namely, the general RND problem 
for T-topes. 

C. RNDjjh an d Other Algorithmic Problems 

RNDhh contains well-studied algorithmic problems as spe- 
cial cases. First, if T> = Hu is the class of hose matrices, then, 
as mentioned earlier, an optimal solution to RND is induced 
by a hub routing template, which is in turn induced by a star 
hub tree. Hence in this case RND H h = RND H ub = RND. 

Second, consider the case where T> consists of a single 
demand matrix D. Obviously, the optimal solution is induced 
by a SP template. RNDhh however, asks for an optimal 
HH templates. This is a classical problem in combinatorial 
optimization called the minimum communication cost tree 
(MCT). Find some tree (not necessarily in G), containing 
V(G), such that if we route the demands D on T, the overall 
routing cost is minimized^] Being allowed to use a hub tree for 
the routing, as opposed to a subtree of G, essentially means 
that we are restricting to MCT with metric costs^] For the 
special case where G is itself a complete graph with unit-cost 
edges, Hu 1231 showed that an optimal solution is induced by 
a so-called Gomory-Hu tree on the complete graph with edge 
capacities -Dij j^] 

As we now see, RNDhh may incur a logarithmic increase 
over RND, and in fact RNDsp. On the positive side, it is 
known that in general there is a tree in the metric completion of 
G whose "distortion" is at most O(logn) [24|. The approach 
of Gupta thus yields an O(logn) approximation for RNDhh- 

5 This is directly related in turn to the average stretch tree problem \22\. 
6 Costs are metric if they are induced by a cost function that is a metric. 
7 In general, the Gomory-Hu tree for a capacitated graph need not be subtree 
of the graph; hence the requirement that G is complete. 
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One can see this by noting that RNDhh costs no more than 
a best tree routing on T in the metric completion. To see this, 
define a hub tree T" by hanging a leaf edge vv' from each 
node of T. Then consider the mapping where each v, v' maps 
to v G G. This establishes: 

Fact 1. For any instance of RND, 

RNDhh < RND TR . 

While these techniques yield an (log n) -approximation for 
RNDhh and hence also MCT, it cannot be ruled out that 
there are polytime 0(l)-approximations. For the purposes 
of our study, we use a heuristic algorithm instead of metric 



embeddings. This is outlined in Section IV-A 



D. Basic Results for RNDhh 

In this section we show that RNDgp and RNDhh are 
incomparable; depending on the instance, one may be signif- 
icantly cheaper than the other. This motivates our main goal 
of classifying the demand spaces according to which template 
is better. 

We now establish a family of RND instances where 
RND 

hh does much worse than RND, and in fact worse than 
RND SP . 

Theorem 1. There is a sequence of (unit-cost) graphs 
{GnjngN on n nodes and demand polytopes {2?„} n eN such 
that for these instances 

RND HH (G„) = fi(lognRND SP (G n )). 

Proof. We describe G = G n and P = P n for a fixed n. The 
network graph G consists of a d-regular, a-expander graph 
on n vertices, where d — 0(1) and a — f2(l) (see [25] 
for existence proof). Directly we have that |-E(G)| = 0(n). 
Moreover, all edges have unit cost. 

The demand polytope T> consists of a single matrix: one 
unit of demand between the endpoints of any edge in G. The 
RNDgp solution then consists of the network itself (route each 
demand on its own edge). Its cost is 0(n), the number of edges 
in the graph. Let T be any hub tree. We show that the cost of 
any HH solution induced by T is Sl(rtlogn). Hence, RNDhh 
is Q,(n\ogn). 

Let c e V(T) be a center of the tree, that is each component 
of T\{c} has at most S leaves (it is well-known, and easy to 
see that every tree has a center). We call an edge e 6 E(G) 
separated by c in T if its endpoints lie in different components 
of T\{c}. We let E c C E(G) be the set of all edges separated 
by c in T. Let Ci be one of the components in T \ {c} and let 
£(Ci) be the leaves of C % . We know that |I(G;)| < § . Hence, 
since G is an a-expander, there is at least a|£(G,)| edges going 
out of Gj. So we have that \E C \ (or the total number of edges 
between the Gj's) is at least 



|£ c |> ^£"1^)1 = 



a \ 
2 ^ 



an 

IT 



Let r) : V(T) -» V(G) be the optimal hub map for T. Call 
a node v G V(G) close to c if 

dis M,<, = »,,<»)>< ^^H. 

Let V c be the set of close vertices in G. Since G is a d-regular 
graph, its size is bounded by 

IVJ < d lo ^( min ihfa» = mini-, — }n. 

We now merge the two concepts and say that an edge e = 
xy € E c that is separated by c in T is good if both x, y £ V c , 
i.e. x, y are close to c. Conversely, an edge e = xy € E c is 
bad if one of its endpoints is not close to c in T. Let B be 
the set of bad edges. By the <i-regularity of G, a bound on the 
number of bad edges is given by 

dan 



B > E n 



d\V c \ an 
2 - ~2~ 



a 

~4d ~ X 1 ' 



Since the bad edges are unit demands that must be simultane- 
ously routable in T, the "non-close" endpoint of these demand 
edges must route through the image 77(c) of c, imposing 
a cost of at least log d (n). We have Q(n) bad demands, 
each incurring a cost of O(logn), and thus the cost of the 
optimal Hierarchical Hubbing induced by T routing is 
fl(n\ogn). □ 

The next result shows that RNDtr may be much cheaper 
than RNDgp. Hence, we can get an unbounded gap between 
RNDgp and RND TR . By Fact 1, this implies that RND H h 
may also be much cheaper than RNDgp. 

Theorem 2. There is a sequence of weighted graphs {G„} n6 N 
and demand polytopes {2?„}„£N such that 



and 



RNDgp (G n ) = n(n 2 ) 



RND TR (G„) =0(1). 



as = n ( me leaves of T consist of the nodes of 

G). 



Proof. The network G — (V, E) (actually G2 n +2) consists of 
two stars centered at nodes a, b connected by an edge. The 
leaves are with the following edges: 

E = {av t : 1 < i < n}U 
{bv' t :l<i< n}U 
{viVj :l<i,j< n}U 
{ab}. 

The edges are weighted as follows: the bridge edge ab has 
weight one, the star edges avi or bv[ for 1 < i < n have 
weight and the edges v^v'a for 1 < i,j < n have weight 
1 - A. 

n 

The (symmetric) demand polytope V consists of convex 
combinations of unit demands between Vi and v'j. That is, for 
each i. j let Z? 4J be the matrix with D v v i = D v > v . = 1 and 
all other entries 0. Then V = conv(D iJ ). 

The optimal shortest path template in the network consists 
of routing the demand between Vi and v'j through the edge of 
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cost 1 — i connecting Vi and . This template induces a cost 
of RND s "p = n 2 (l - i) = n 2 - n = f7(n 2 ). 

Now consider the tree template induces by the union of the 
two stars. In order to support T>, the sub-optimization problem 
only reserves one unit of capacity on the bridge edge ab, as 
well as one unit on all star edges. Hence, this capacity cost is 
0(1), and is an upper bound for the best tree solution. □ 

V. Evaluation Studies 

We compare the network design costs for capped hose 
models using the routing strategies SP and HH. We solve the 
associated optimization problems (RNDgp, RNDhh) across 
many instances of the capped hose traffic model. For the 
purposes of this study, we implemented an exact method for 
SP, and employed our BTSM heuristic in the case of HH 
routing. 

For the evaluation set-up, we use two carrier network 
topologies: the American backbone network Abilene (11 
nodes, 14 edges), and the Australian telecom network Telstra 
ll26l (104 nodes, 151 edges). We assume that per-unit link 
capacity costs are proportional to physical distances so we 
use this as our cost vector c for determining the best RND 
solution. Our traffic data is based both on randomly generated 
traffic instances within the capped hose model constraints, as 
well as on a traffic scenario based on real data. 

For the simulated traffic scenarios, for each network we 
randomly generate many traffic instances (each corresponding 
to a capped hose poly tope) on which to solve RND. That is, 
each instance arises from some collection of peak demands 
and marginals. Our random instances are generated starting 
from actual population statistics (see Section fV-A) . In addition 
on Abilene, we use real data based on previous work from 
ll27l which gives a time series of point-to-point demand 
measurements that can be used to create capped hose models 
(described later). 

Our results show that HH templates are often the most 
cost-effective. That is, RNDhh is very often less than both 
RNDsp, and RNDhub- In particular, this means that by 
adding peak capacities U(i,j) into the hose model, the single- 
hub tree-routing template is no longer cost-effective. We need 
additional hubs for a cost-effective network design. This is 
strong evidence for the use of HH routing. 

A. Generating Traffic 

To generate our traffic demand matrices, namely marginals 
U(i) and peak demands U (i, j), we try to mimic how traffic is 
naturally generated in todays core networks. It has been widely 
accepted that core traffic is stochastic in nature ("bursty") 
and that peak demand is much larger than average demand. 
This change in traffic patterns presents a moving target for 
service providers. The uncertainty is due to novel content- 
based network applications combined with factors such as data 
center consolidations and content mobility (content migrating 
from location to location based on where they are consumed). 
Therefore in our evaluation scenario, we first assume peak 
demands, and then impose marginals based on the equipment 



choice at each node (capacity and number of ports at the 
ingress and egress nodes). Our process is anchored by first 
fixing the peak demand between every pair of node locations 
based on their population |28l . This gravity model takes into 
account the population size of the two sites as well as their 
distance. Since larger sites attract more traffic and closer 
proximity can lead to greater attraction, the gravity model 
incorporates these two features. The relative traffic between 
two places is determined by multiplying the population of one 
city by the population of the other city, and then dividing the 
product by the distance between the two cities squared. In our 
case, the peak demand between two cities i.j is 

U(i,j) — a(i,j) Population(i) Population(j) (4) 

where a(i,j) = l/distance(i, j) and distance^, j) is the 
geographical distance between node i and node j. 

As datacenter based traffic dominates over the Internet, 
significant research activity is underway to learn the char- 
acteristics of this traffic. However, as noted in |29| we see 
that the end-to-end traffic matrix showing how the total traffic 
to YouTube video-servers are divided among different data 
centers follows the gravity model. On the other hand, the entry- 
exit traffic matrix seen by the ISP suggests that data-center 
traffic flows are not getting divided to different locations. This 
is due to the impact of BGP routing policies employed by the 
ISP to handle datacenter-bound traffic. 
Generating marginals 

To generate marginals, we propose sampling within the 
individual relevance intervals, as discussed in Section |II-B| 
We start by discretizing U(i)'s relevance interval into s > 
steps, for each of the n nodes i. Then, for each value <7j G 
{0, l,2...,s} we get a U(i) within its relevance intervals. 
Formally: 

U(i) = maxU(i,j) + — (Vc/(y) -maxU(i,j)). 
j s j 

j 

Thus each vector u <E {0, 1, ... , s} n yields an instantiation 
of the marginals. For fixed s and n, this gives (s + 1)" pos- 
sible data points, yielding all possible marginals within their 
relevance interval up to a precision based on the coarseness 
of the discretization (size of s). 

It is impractical to generate all (s + l) n points for large 
n and s. To bypass this, we use a smaller value k in place 
of n to generate (s + l) k points. We then sample uniformly 
from these a to assign the values for each node, i.e. we mimic 
the distribution of the short vector a of length k to create our 
longer marginal vector of length n. So we get 

UU) = m&xU(iJ)+ C ^( s rU(iJ)--miixU{i,j)). (5) 

3 S t— 1 j 

3 

where a E {0,1, . . . , s} k and U[k] is chosen uniformly at 
random from {1,2,..., k}. 

B. Time-Series Traffic Matrix Data 

For the Abilene network, we use the estimation of point- 
to-point demands from the work of ll27l to create a realistic 
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Fig. 3. Going from ij to ij+i, always through e = st and along shortest 
paths. 

data point. The data consists of 24 weeks worth of traffic 
matrices sampled each five minutes. In other words, we get 
a sequence {Djj}?^ (T = 48384) of demands from i to 
j for all nodes i,j in the network. We transform this data 
into marginals and peak demands for a capped hose model 
as follows. The peak demand between two nodes i,j is by 
definition U(i,j) — max t Dj y Similarly, node i's marginal is 
the (minimum) capacity required to route the maximum traffic 
out of that node at a given time step: U(t) = max t . D\ y 
We consider different sampling intervals, i.e. various T's. 

C. Computation of Link Costs 

It is simple to compute the capacity u(e) needed on each 
edge e for HH - we just add up the capacities from every 
hub tree edge / = ij for which the r](i)r)(j) -path passes 
through e. For SP, we follow a similar sub-optimization as 
used for HH templates. Namely, we find the maximum value 
of u(e) = Y^ij Wij(e)Dij over all demand matrices D in our 
capped hose universe T> where Wij(e) is 1 if Py uses edge e, 
and is otherwise. For any edge e = st, it is straightforward 
to see that the set of ij's with Wij(e) = 1 induces a bipartite 
graph H — (V(G), {ij : aiy = 1}) when the P^-'s are shortest 
paths. Suppose H is not bipartite, then there is an odd cycle 
C = i±i 2 ■ ■ ■ t2k+i- Thus, w i2k+lll = 1 and w ijij+1 = 1 for 
all 1 < 3 ' < 2fc + 1. Without loss of generality, the fact that 
Pjj's are shortest paths implies that ij is closer to s for odd j's 
and closer t for even j's (see Figure 3). But then this means 
the path P- 2k+lil = SP G (i 2k +i,s) U SP G (Mi) is strictly 
shorter than Pi 2k+1 .i 1 as it doesn't use e; contradiction. Hence 
we may use a max flow routing algorithm to compute u(e). 
The computation of link capacities can thus be done with a 
collection of all shortest paths {Py} generated using standard 
algorithms. 

Algorithm 2 illustrates our experimental procedure. 

D. Results 

In Figures 2 and 3, we plot the results of the low-cost 
routing templates for two different network topologies Abilene 
and Telstraj Traffic matrix instances are plotted according to 
their marginal and peak demand strengths and coloured based 
on the ratio between the edge cost of the (optimal) SP routing 
template and the HH routing template. A blue point indicates 
that SP is better and a red point indicates that HH is better. 

8 A much smaller number of points were sampled to cover most of the 
spectrum due to the computational constraints caused by the size of the Telstra 
network. 



Algorithm 2 Experimental procedure. 

Require: Network graph G = (V, E) (where edge costs 
correspond to the geographical distance between edge's 
endpoints) and parameters s, k e N. 

for each pair of node cities i,j G V do 

Fix point-to-point peak demand between i and j as per 

Equation Q. 
end for 

for each a £ (s + l) k do 
for each i e V do 

Fix node i's marginal as per Equation |5]). 
end for 

This gives a capped hose model cH arising from 
marginals U(i)'s and peak demands U(i,j)'s. 
Find marginal strength vector /i of cH. 
Find peak demand strength vector tt of cH. 
Solve RND S p (G,c"H) optimally. 

Use our BTSM heuristic and Olver et al's algorithm 
lfT51l to approximately solve RNDhh(G, cH) and get a 
solution of cost RND' HH (G, cH). 
Report || M || 2 , ||7r|| 2 and R R N N D D f ( gg . 
end for 



Moreover, the intensity of the colouring is proportional to the 
proximity of the ratio to one. If the costs of SP and HH are 
very similar, the color is almost white, otherwise it is highly 
saturated 

Intuitively, and as discussed in Section |II-C| the i-axis 
corresponds to the affinity of the traffic to the hose model. 
A large fj, value means more constraining marginals. Thus 
the optimal routing should primarily consider the marginals 
only. So we expect hub routing to do much better. Similarly, 
the y-axis corresponds to the affinity of the traffic to the 
peak-demand model. Again, a large tt value means more 
constraining peak demands, and thus we expect SP to do much 
better. Furthermore, there is a duality between the [i and 7r 
values: a demand universe can only have strong marginals or 
strong peak demands, not both. This explains the quarter circle 
"swoosh" shape of the plot. 

VI. Discussion 

In (3), hub routing was used to compare the cost of two 
architectures: one based on IP routing and one employing 
circuits. Our work focuses on how the link and/or node costs 
vary with the choice of routing template. We now address 
some of the issues and our findings. 

A. \i — 7r as indicators for the choice of a routing template 

We see that there is a continuum of routing templates from 
HH to SP if viewed as a function of the newly introduced 
p, and tt measures. As such we propose that it is possible to 
define a routing indicator that accounts for the strengths of 
the marginals and peak demands and use this information to 
choose the appropriate routing template. This is an important 
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Fig. 4. The ratio between the edge cost of the (optimal) SP routing and the 
HH routing found with our heuristic plotted against the norms of the peak 
demand and marginal strength vectors for varying traffic, and the Abilene 
network. Data points from the time-series of traffic matrices added, with 
duration considered as label. Better viewed in color. 



Ratio SP/HH of Total Edge Cost for Telstra 
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Fig. 5. The ratio between the edge cost of the (optimal) SP routing and 
the HH routing found with our heuristic plotted against the norms of the 
peak demand and marginal strength vectors for varying traffic, and the Telstra 
network. Better viewed in color. 

observation because it suggests a new strategy for service 
providers faced with designing for changing demands. In 
addition to just using marginal demands, they can sample 
the demand space to determine the relative strengths of the 
peak and marginal demands. They may use this to choose an 
initial routing template and if necessary, evolve the template 
to obtain/maintain cost-efficient routing. This further justifies 
the need to measure link level traffic, both peak and average 
demand, since it can feedback into the selection of an optimal 
routing template for a given network topology. 

An important consideration is the sampling interval. The 
impact of the sampling interval can be clearly seen by the 
evolution of the real data points in Figures 4 and 5. Each 



TABLE I 

Number of Instances where HH Outperforms SP 



Network 


HH Optimal 


HH Optimal 




Multiple Hubs 


Single Hub 


Abilene (total number of instances: 6561) 


3045 


8 


Telstra (total number of instances: 353) 


246 






such data point comes from a compilation of the real traffic 
matrices over a certain period of time, as labelled next to 
the points. As the interval increases from one week to four, 
we see the capped hose model shift toward a more hose-like 
universe, with stronger marginal (and consequently weaker 
peak demands). If samples are over a short time horizon, then 
the demand matrix does not change significantly and as such 
we are designing for a single demand. In this scenario, SP 
yields the cheapest routing template. However as the time 
horizon gets longer, the different demand matrices could be 
significantly different and in such scenarios the more flexible 
design of HH/VPN brings in the added cost benefits. 

B. Single versus Multi Hub Routing Templates 

An interesting observation of this study is that there are 
significant cost benefits for using a multi-hub routing template 
versus the single-hub templates which yield optimal VPNs. 
Table U shows the number of traffic instances where HH is 
more cost effective than a single hub VPN solution. It is 
straightforward to see that a best solution induced by any hub 
tree is always as good (in terms of total cost) as a single- 
hub routing. Indeed, mapping all internal nodes of a hub tree 
to a single network node is always a valid hub placement 
that yields the single-hub VPN-like routing. Since the hub 
placement algorithm |[T5l is optimal, it is guaranteed to find a 
hub placement that is as good as the best single-hub routing. 

C. Routing Costs and Impact on Network Design 

We evaluate the cost-effectiveness of the HH template 
against SP using both link and node costs. Link or edge 
capacity costs for HH are computed by solving a maximiza- 
tion problem over the universe of traffic matrices. For node 
costs we use the maximum possible link capacities that are 
incident on a particular node. The intuition behind using these 
maximum incident capacities is based on the hardware port 
requirements for routing/switching the total traffic through that 
node. However it is very rare for the all incident links at the 
node to simultaneously carry their maximum allowable traffic 
rates. 

From Figures 4 and 6 for link and node costs respectively, 
we see that HH shows significant cost savings for link costs 
since we take advantage of dense traffic sharing points through 
appropriate location of hub nodes. However, since the node 
costs are being evaluated using physical link capacities the 
gains are not as significant. 

VII. Conclusion 

The capped hose model introduced in this paper shines light 
on the applicability of current routing template archetypes 
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Fig. 6. The ratio between the node cost of the (optimal) SP routing and the 
HH routing found with our heuristic plotted against the norms of the peak 
demand and marginal strength vectors for varying traffic, and the Abilene 
network. Data points from the time-series of traffic matrices added, with 
duration considered as label. Better viewed in color. 

(HUB and SP) parametrized by the marginal and peak de- 
mands of the given traffic matrices. We have shown that 
using the capped hose model provides additional flexibility 
to network engineers for cost-optimal network design and 
shows the relevance of the newly introduced HH templates. In 
particular, this means that by adding peak capacities U(i,j) 
into the hose model, the single-hub tree routing template is no 
longer cost-effective. Moreover, the peak demand and marginal 
strengths presented here are a first attempt at choosing routing 
templates based solely on traffic information. Finally, the 
RNDhh problem is of theoretical interest by itself, encom- 
passing many other well-known algorithmic problems, and is 
open to further investigation. 
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